NEUOM: Identifying Opinionated Sentences in Chinese and English Text
نویسندگان
چکیده
NEUOM: Identifying Opinionated Sentences in Chinese and English Text Zhang, Ke Wang, Muhua Zhu, Tong Xiao, Jingbo Zhu Natural Language Processing Lab, Northeastern University {zhangcl, xiaotong, zhujingbo}@mail.neu.edu.cn [email protected], [email protected] Abstract This paper introduces our NEUOM system which participates in the opinionated sentence detection task, one of evaluation tasks in Multilingual Opinion Analysis Task (MOAT) of NTCIR-7. NEUOM system adopts a sentiment lexicon-based(SLB) approach to identifying opinionated sentences in a Chinese text and English text. For English task, a machine learning algorithm, naïve Bayesian classification model, is also tried with the use of the English training corpora, such as MPQA and NTCIR-6 data set. Experimental results show that in the English task SLB method achieved better F1 performance than Naïve Bayesian model.
منابع مشابه
zNLP: Identifying Parallel Sentences in Chinese-English Comparable Corpora
This paper describes the zNLP system for the BUCC 2017 shared task. Our system identifies parallel sentence pairs in Chinese-English comparable corpora by translating word-by-word Chinese sentences into English, using the search engine Solr to select near-parallel sentences and then by using an SVM classifier to identify true parallel sentences from the previous results. It obtains an F1-score ...
متن کاملNTCIR-6 at Maryland: Chinese Opinion Analysis Pilot Task
For the Chinese opinion analysis pilot task at NTCIR-6, we tested two techniques for each of the four subtasks—identifying opinionated sentences, making polarity decisions, identifying opinion holders, and retrieving topically relevant sentences. Our opinion detection technique is based on sentiment lexicons. We explored three main issues: the effect of the size of sentiment lexicons on the acc...
متن کاملSentence-Level Opinion Analysis for Chinese News Documents Based on Sen- timent Information of Social Tags
Social tags have been considered to indirectly reflect authorized opinions of taggers. This paper proposes an unsupervised method which derives implicit sentiment information from social tags to decide, in one document, which sentences are opinionated, as well as to annotate them with proper polarity labels. First, for a social tag, its opinion degree is measured by aggregating the opinion degr...
متن کاملOpinion Annotation in On-line Chinese Product Reviews
This paper presents the design and construction of a Chinese opinion corpus. Based on the observation on the characteristics of opinion expression in Chinese online product reviews, which is quite different from in the formal texts such as news, an annotation framework is proposed to guide the construction of an opinion corpus based on online product reviews. The opinionated sentences are manua...
متن کاملIdentifying Opinionated Sentences
In the news, editorials, reviews, and letters to the editor are sources for finding opinions, but even in news reports, segments presenting objective facts are often mixed with segments presenting opinions and verbal reactions. This is especially true for articles that report on controversial or “lightning rod” topics. Thus, there is a need to be able to identify which sentences in a text actua...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008